137 research outputs found

    Accounting for epistatic interactions improves the functional analysis of protein structures

    Get PDF
    Motivation: The constraints under which sequence, structure and function coevolve are not fully understood. Bringing this mutual relationship to light can reveal the molecular basis of binding, catalysis and allostery, thereby identifying function and rationally guiding protein redesign. Underlying these relationships are the epistatic interactions that occur when the consequences of a mutation to a protein are determined by the genetic background in which it occurs. Based on prior data, we hypothesize that epistatic forces operate most strongly between residues nearby in the structure, resulting in smooth evolutionary importance across the structure. Methods and Results: We find that when residue scores of evolutionary importance are distributed smoothly between nearby residues, functional site prediction accuracy improves. Accordingly, we designed a novel measure of evolutionary importance that focuses on the interaction between pairs of structurally neighboring residues. This measure that we term pair-interaction Evolutionary Trace yields greater functional site overlap and better structure-based proteome-wide functional predictions. Conclusions: Our data show that the structural smoothness of evolutionary importance is a fundamental feature of the coevolution of sequence, structure and function. Mutations operate on individual residues, but selective pressure depends in part on the extent to which a mutation perturbs interactions with neighboring residues. In practice, this principle led us to redefine the importance of a residue in terms of the importance of its epistatic interactions with neighbors, yielding better annotation of functional residues, motivating experimental validation of a novel functional site in LexA and refining protein function prediction. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online

    Structure and evolutionary trace-assisted screening of a residue swapping the substrate ambiguity and chiral specificity in an esterase

    Get PDF
    11 pags., 6figs., 3 pags.Our understanding of enzymes with high substrate ambiguity remains limited because their large active sites allow substrate docking freedom to an extent that seems incompatible with stereospecificity. One possibility is that some of these enzymes evolved a set of evolutionarily fitted sequence positions that stringently allow switching substrate ambiguity and chiral specificity. To explore this hypothesis, we targeted for mutation a serine ester hydrolase (EH) that exhibits an impressive 71-substrate repertoire but is not stereospecific (e.e. 50%). We used structural actions and the computational evolutionary trace method to explore specificity-swapping sequence positions and hypothesized that position I244 was critical. Driven by evolutionary action analysis, this position was substituted to leucine, which together with isoleucine appears to be the amino acid most commonly present in the closest homologous sequences (max. identity, ca. 67.1%), and to phenylalanine, which appears in distant homologues. While the I244L mutation did not have any functional consequences, the I244F mutation allowed the esterase to maintain a remarkable 53-substrate range while gaining stereospecificity properties (e.e. 99.99%). These data support the possibility that some enzymes evolve sequence positions that control the substrate scope and stereospecificity. Such residues, which can be evolutionarily screened, may serve as starting points for further designing substrate-ambiguous, yet chiral-specific, enzymes that are greatly appreciated in biotechnology and synthetic chemistry.MF acknowledges the grant ‘INMARE’ from the EuropeanUnion’s Horizon 2020 (grant agreement no. 634486), the grantsPCIN-2017-078 (within the Marine Biotechnology ERA-NET) and BIO2017-85522-R from the Ministerio de Economía, Industria y Competitividad, Agencia Estatal de Investigación (AEI), Fondo Eur-opeo de Desarrollo Regional (FEDER) and the European Union (EU),and the grant 2020AEP061 from the Agencia Estatal CSIC. J.S-A.acknowledges grant PID2019-105838RB-C33 from the Ministeriode Ciencia e Innovación, Agencia Estatal de Investigación (AEI),Fondo Europeo de Desarrollo Regional (FEDER) and the EuropeanUnion (EU). P.N.G. acknowledges the support of the Era-Net IB Pro-ject MetaCat funded through UK Biotechnology and BiologicalSciences Research Council (BBSRC), grant No. BB/M029085/1, andthe Centre for Environmental Biotechnology Project, co-fundedby European Regional Development Fund (ERDF) via the WelshGovernment (WEFO); R.B. acknowledges the Supercomputing Wales project, co-funded by ERDF via WEFO. OL and PK were sup-ported by the National Institutes of Health (NIH) grants 5R01AG061105, 5R01GM066099, and 5R01GM079656. C. Coscolínthanks the Ministerio de Economía y Competitividad and FEDER fora PhD fellowship (Grant BES-2015-073829). Staff of the Synchrotron Radiation Source at Alba (Barcelona, Spain) for assistance at the BL13-XALOC beamlin

    Exploring use of unsupervised clustering to associate signaling profiles of GPCR ligands to clinical response.

    Get PDF
    Signaling diversity of G protein-coupled (GPCR) ligands provides novel opportunities to develop more effective, better-tolerated therapeutics. Taking advantage of these opportunities requires identifying which effectors should be specifically activated or avoided so as to promote desired clinical responses and avoid side effects. However, identifying signaling profiles that support desired clinical outcomes remains challenging. This study describes signaling diversity of mu opioid receptor (MOR) ligands in terms of logistic and operational parameters for ten different in vitro readouts. It then uses unsupervised clustering of curve parameters to: classify MOR ligands according to similarities in type and magnitude of response, associate resulting ligand categories with frequency of undesired events reported to the pharmacovigilance program of the Food and Drug Administration and associate signals to side effects. The ability of the classification method to associate specific in vitro signaling profiles to clinically relevant responses was corroborated using β2-adrenergic receptor ligands.This research was supported by a research contract from Pfizer Inc. and grants from the Natural Sciences and Engineering Research Council of Canada (Grant 311997 to G.P.) and the Canadian Institutes of Health Research MOP 324876 (to G.P.), MOP 102630 (to M.B. and O.L.) and Foundation grant (FDN-148431) to MB. MB holds a Canada Research Chair in Signal Transduction and Molecular Pharmacology. Dr Lichtarge’s research was supported by National Institutes of Health (NIH 2R01 GM066099; NIH 5R01 GM079656). B.B. was supported by a studentship from Fonds de Recherche en Santé du Québec. P.D. was supported by a MITACS fellowship

    Evolutionary Action Score of TP53 Identifies High-Risk Mutations Associated with Decreased Survival and Increased Distant Metastases in Head and Neck Cancer

    Get PDF
    TP53 is the most frequently altered gene in head and neck squamous cell carcinoma, with mutations occurring in over two-thirds of cases, but the prognostic significance of these mutations remains elusive. In the current study, we evaluated a novel computational approach termed evolutionary action (EAp53) to stratify patients with tumors harboring TP53 mutations as high or low risk, and validated this system in both in vivo and in vitro models. Patients with high-risk TP53 mutations had the poorest survival outcomes and the shortest time to the development of distant metastases. Tumor cells expressing high-risk TP53 mutations were more invasive and tumorigenic and they exhibited a higher incidence of lung metastases. We also documented an association between the presence of high-risk mutations and decreased expression of TP53 target genes, highlighting key cellular pathways that are likely to be dysregulated by this subset of p53 mutations that confer particularly aggressive tumor behavior. Overall, our work validated EAp53 as a novel computational tool that may be useful in clinical prognosis of tumors harboring p53 mutations

    Accurate Protein Structure Annotation through Competitive Diffusion of Enzymatic Functions over a Network of Local Evolutionary Similarities

    Get PDF
    High-throughput Structural Genomics yields many new protein structures without known molecular function. This study aims to uncover these missing annotations by globally comparing select functional residues across the structural proteome. First, Evolutionary Trace Annotation, or ETA, identifies which proteins have local evolutionary and structural features in common; next, these proteins are linked together into a proteomic network of ETA similarities; then, starting from proteins with known functions, competing functional labels diffuse link-by-link over the entire network. Every node is thus assigned a likelihood z-score for every function, and the most significant one at each node wins and defines its annotation. In high-throughput controls, this competitive diffusion process recovered enzyme activity annotations with 99% and 97% accuracy at half-coverage for the third and fourth Enzyme Commission (EC) levels, respectively. This corresponds to false positive rates 4-fold lower than nearest-neighbor and 5-fold lower than sequence-based annotations. In practice, experimental validation of the predicted carboxylesterase activity in a protein from Staphylococcus aureus illustrated the effectiveness of this approach in the context of an increasingly drug-resistant microbe. This study further links molecular function to a small number of evolutionarily important residues recognizable by Evolutionary Tracing and it points to the specificity and sensitivity of functional annotation by competitive global network diffusion. A web server is at http://mammoth.bcm.tmc.edu/networks

    Rapid detection of similarity in protein structure and function through contact metric distances

    Get PDF
    The characterization of biological function among newly determined protein structures is a central challenge in structural genomics. One class of computational solutions to this problem is based on the similarity of protein structure. Here, we implement a simple yet efficient measure of protein structure similarity, the contact metric. Even though its computation avoids structural alignments and is therefore nearly instantaneous, we find that small values correlate with geometrical root mean square deviations obtained from structural alignments. To test whether the contact metric detects functional similarity, as defined by Gene Ontology (GO) terms, it was compared in large-scale computational experiments to four other measures of structural similarity, including alignment algorithms as well as alignment independent approaches. The contact metric was the fastest method and its sensitivity, at any given specificity level, was a close second only to Fast Alignment and Search Tool—a structural alignment method that is slower by three orders of magnitude. Critically, nearly 40% of correct functional inferences by the contact metric were not identified by any other approach, which shows that the contact metric is complementary and computationally efficient in detecting functional relationships between proteins. A public ‘Contact Metric Internet Server’ is provided

    Separation of Recombination and SOS Response in Escherichia coli RecA Suggests LexA Interaction Sites

    Get PDF
    RecA plays a key role in homologous recombination, the induction of the DNA damage response through LexA cleavage and the activity of error-prone polymerase in Escherichia coli. RecA interacts with multiple partners to achieve this pleiotropic role, but the structural location and sequence determinants involved in these multiple interactions remain mostly unknown. Here, in a first application to prokaryotes, Evolutionary Trace (ET) analysis identifies clusters of evolutionarily important surface amino acids involved in RecA functions. Some of these clusters match the known ATP binding, DNA binding, and RecA-RecA homo-dimerization sites, but others are novel. Mutation analysis at these sites disrupted either recombination or LexA cleavage. This highlights distinct functional sites specific for recombination and DNA damage response induction. Finally, our analysis reveals a composite site for LexA binding and cleavage, which is formed only on the active RecA filament. These new sites can provide new drug targets to modulate one or more RecA functions, with the potential to address the problem of evolution of antibiotic resistance at its root
    • …
    corecore